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1 AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG GCTTGAAGAC CCAGAAGATG 
TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC CGAACTTCTG GGTCTTCTAC 

61 ACATATGCAA G7ATGATTT7 GTAGAAGTTG AGGAACCCAG TGATGGAACT ATATTAGGGC 
TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC ACTACCTTGA TATAATCCCG 

121 GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC TAAAGGAAAT CAAATTAGGA 
CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG ATTTCCTTTA GTTTAATCCT 

+ 1 MetAsn IlePheLeu LeuAsnLeuLeu ThrGluGlu ValArgLeu 
j 

181 TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCTTC TAACAGAGGA GGTAAGATTA 
ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGAAG ATTGTCTCCT CCATTCTAAT 

+1 TyrSerCysThr ProArgAsn PheSerVal SerlleArgGlu GluLeuLys ArgThrAsp 



2 41 TACAGCTGCA CACCTCGTAA CTTCTCAG7G TCCATAAGGG AAGAACTAAA GAGAACCGAT 
ATGTCGACGT GTGGAGCATT GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA 

+1 ThrllePheTrp ProGlyCys LeuLeuVal LysArgCysGiy GlyAsnCys AlaCysCys 



301 ACCATTTTCT GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT 
TGGTAAAAGA CCGGTCCAAC AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA 

+1 LeuHisAsnCys AsnGluCys GlnCysVal ProSerLysVal ThrLysLys TyrHisGlu 



3 61 CTCCACAATT GCAATGAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG 
GAGGTGTTAA CGTTACTTAC AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC 

+1 ValLeuGlnLeu ArgProLys ThrGlyVal ArgGlyLeuKis LysSerLeu ThrAspVal 



421 GTCCTTCAGT TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG 
CAGGAAGTCA ACTCTGGTTT CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC 

+1 AlaLeuGluHis KisGluGiu CysAspCys ValCysArgGly SerThrGly Gly 



481 GCCCTGGAGC ACCATGAGGA GTGTGAC7GT GTGTGCAGAG GGAGCACAGG AGGATAGCCG 
CGGGACCTCG 7GG7ACTCC7 CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC 

541 CATCACCACC AGCAGCTCT? GCCCAGAGC? GTGCAGTGCA GTGGCTGATT CTATTAGAGA 
GTAGTGGTGG TCGTCGAGAA CGGGTCTCGA CACG7CACGT CACCGACTAA GATAATCTCT 

601 ACGTATGCGT TATCTCCATC CTTAATCTCA GTTG7TTGCT 7CAAGGACCT TTCATCTTCA 
TGCATACGCA ATAGAGGTAG GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT 

661 GGATTTACAG TGCATTCTGA AA G A GG AG AC ATCAAACAGA ATTAGGAGTT GTGCAACAGC 
CCTAAATGTC ACGTAAGACT 77C7CC7C7G TAGT7TGTCT 7AATCCTCAA CACGTTGTCG 

721 TCTTTTGAGA GGAGGCCTAA AGGACAGGAG AAAAGG7C7T CAATCGTGGA AAG AAAATT A 
AGAAAACTCT CC7CCGGA77 TCCTGTCCTC TT77CCAGAA G7TAGCACCT TTCTTTTAAT 

781 AATGTTGTAT TAAA7AGATC ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG 
TTACAACATA AT77ATC7AG 7GGTCGA7CA AAGTCTCAAT GGTACATGCA TAAGGTGATC 
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8 41 CTGGGTTCTG TATTTCAGTT CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA 

GACCCAAGAC ATAAAGTCAA GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT 

9 01 ACTGTGCAAG TGAGCACCTG ATTCCGTTGC CTTGCTTAAC TCTAAAGCTC CATGTCCTGG 

TGACACGTTC ACTCGTGGAC TAAGGCAACG GAACGAATTG AGATTTCGAG GTACAGGACC 

9 61 GCCTAAAATC GTATAAAATC TGGATTTTTT TTTTTTTTTT TGCTCATATT CACATATGTA 
CGGATTTTAG CATATTTTAG ACCTAAAAAA AAAAAAAAAA ACGAGTATAA GTGTATACAT 

1021 AACCAGAACA TTCTATGTAC TACAAACCTG GTTTTTAAAA AGGAACTATG TTGCTATGAA 
TTGGTCTTGT AAGATACATG ATGTTTGGAC CAAAAATTTT TCCTTGATAC AACGATACTT 

10 81 TTAAACTTGT GTCGTGCTGA TAGGACAGAC TGGATTTTTC ATATTTCTTA TTAAAATTTC 
AATTTGAACA CAGCACGACT ATCCTGTCTG ACCTAAAAAG TATAAAGAAT AATTTTAAAG 

1141 TGCCATTTAG AAGAAGAGAA CTACATTCAT GGTTTGGAAG AGATAAACCT GAAAAGAAGA 
ACGGTAAATC TTCTTCTCTT GATGTAAGTA CCAAACCTTC TCTATTTGGA CTTTTCTTCT 

12 01 GTGGCCTTAT CTTCACTTTA TCGATAAGTC AGTTTATT7G TTTCATTGTG TACATTTTTA 
CACCGGAATA GAAGTGAAAT AGCTATTCAG TCAAATAAAC AAAGT AAC AC ATGTAAAAAT 

12 61 TATTCTCCTT TTGACATTAT AACTGTTGGC TTTTCTAATC TTGTTAAATA TATCTATTTT 

ATAAGAGGAA AACTGTAATA TTGACAACCG AAAAGATTAG AACAATTTAT ATAGATAAAA 

13 21 TACCAAAGGT ATTTAATATT CTTTTTTATG ACAACTTAGA TCAACTATTT TTAGCTTGGT 

ATGGTTTCCA TAAATTATAA GAAAAAATAC TGTTGAATCT AGTTGATAAA AATCGAACCA 

13 81 AAATTTTTCT AAACACAATT GTTATAGCCA GAGGAACAAA GATGATATAA AATATTGTTG 
TTTAAAAAGA TTTGTGTTAA CAATATCGGT CTCCTTGTTT CTACTATATT TTATAACAAC 

1441 CTCTGACAAA AATACATGTA TTTCATTCTC GTATGGTGCT AG AG TT AG AT TAATCTGCAT 
GAGACTGTTT TTATGTACAT AAAGTAAGAG CATACCACGA TCTCAATCTA ATTAGACGTA 

15 01 TTTAAAAAAC TGAATTGGAA TAGAATTGGT AAGTTGCAAA GACTTTTTGA AAATAATTAA 
AAATTTTTTG ACTTAACCTT ATCTTAACCA TTCAACGTTT CTGAAAAACT TTTATTAATT 

15 61 ATTATCATAT CTTCCATTCC TGTTATTGGA GATGAAAATA AAAAGCAACT TATGAAAGTA 
TAATAGTATA GAAGGTAAGG ACAATAACCT CTACTTTTAT TTTTCGTTGA ATACTTTCAT 

1521 GACATTCAGA TCCAGCCATT ACTAACCTAT TCCTTTTTTG GGGAAATCTG AGCCTAGCTC 
CTGTAAGTCT AGGTCGGTAA TGATTGGATA AGGAAAAAAC CCCTTTAGAC TCGGATCGAG 

15 81 AGAAAAACAT AAAGCACCTT G AAAAAG ACT TGGCAGCTTC CTGATAAAGC GTGCTGTGCT 
TCTTTTTGTA TTTCGTGGAA CTTTTTCTGA ACCGTCGAAG GACTATTTCG CACGACACGA 

17 41 GTGCAGTAGG AACACATCCT ATTTATTGTG ATGTTGTGGT TTTATTATCT TAAACTCTGT 
CACGTCATCC TTGTGTAGGA TAAATAACAC TACAACACCA AAATAATAGA ATTTGAGACA 

1801 TCCATACACT TGTATAAATA CATGGATATT TTTATGTACA GAAGTATGTC TCTTAACCAG 
AGGTATGTGA ACATATTTAT GTACCTATAA AAATACATGT CTTCATACAG AGAATTGG7C 

1861 TTCACTTATT GTACCTGG 
AAGTGAATAA CATGGACC 



r 
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Predicted VEGF-Iike protein encoded by Incyte contig of 8/12/98 

1 MNIFLLNLLT E3VRLYSCTP RNTSVSIREE LKRTDTIFW? GCLLVKRCGG 
51 NCACCLHNCN ECQCVPS5CVT KXYHEVLQLR PKTGVRGLHK SLTDVALEHH 
101 EECDCVCRGS TGG 



PCR primers for cloning VEGF-X 



vegfXl AAAATGTATGGATACAACTTAC 

vegfX2 GTTTGATGAAAGATTTGGGCTTG 

vegfX3 TTTCTAAAGGAAATCAAATTAG 

vegfX4 GATAAGATTTGTATCTGATG 

vegfX5 GATGTCTCCTCTTTCAG 

vegfXo GCACAACTCCTAATTCTG 

vegfX7 AGCACCTGATTCCGTTGC 

vegfX8 TAGTACATAGAATGTTCTGG 

vegfX9 AAG AG AC ATAC TTCTGTAC 

vegfX 1 0 C C AG G TAC AAT AAG TG AACTG 
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JRF) 



Variants Isolated by PCR (at 8/2/99, all cloned and 



sequenced at 



a b 



PCR primers- 



c d 



e f 



Incyte contig 
(8/12/98) 



clone 22, 29, 41 



clone 52, 59 



clone 15, 20 



clones 57, 25, 
26, 27 



2.1kb clones 1, 
2, 3 



primers- a - vegfXl 
(see fig 3) d- vegfXS 



b- vegfX2 
e- vegfX9 



c- vegfX5 
f- vegfXIO 
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VEGF-X 5' RACE primers 

vegfXl 1 CCTTTAGAAATCTGTTTTCCTGGTACAG 
vegfXl2 GGAAAATATTCATCAGATACAAATCTTATCC 
vegfXl3 GGTCCAGTGGCAAAGCTGAAGG 
vegfXU CTGGTTCAAGATATCGAATAAGGTCTTCC 
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/^/cr! & DNA sequence assembled from in-house clones and 5'RACE 



1 TGCCAGAGCA GGTGGGCGCT TCCACCCCAG TGCAGCCT7C CCCTGGCGGT GGTGAAAGAG 
ACGGTCTCGT CCACCCGCGA AGGTGGGGTC ACG7CGGAAG GGGACCGCCA CCACTTTCTC 

61 ACTCGGGAGT CGCTGCTTCC AAAGTGCCCG CCGTGAGTGA GCTCTCACCC CAGTCAGCCA 
TGAGCCCTCA GCGACGAAGG TTTCACGGGC GGCACTCACT CGAGAGTGGG GTCAGTCGGT 

+2 MetSerLeu PheGlyLeuLeu LeuLeuThr SerAlaLeu AlaGlyGlnArg GlnGlyTh 
] 

121 AATGAGCCTC TTCGGGCTTC " TCCTGCTGAC ATCTGCCCTG GCCGGCCAGA GACAGGGGAC 
TTACTCGGAG AAGCCCGAAG AGGACGACTG TAGACGGGAC CGGCCGGTCT CTGTCCCCTG 

+2 rGlnAlaGlu SerAsnLeuSer SerLysPhe GlnPheSer SerAsnLysGlu GlnAsnGl 

181 TCAGGCGGAA TCCAACCTGA GTAGTAAATT CCAGTTTTCC AGCAACAAGG AACAGAACGG 
AGTCCGCCTT AGGTTGGACT CATCATTTAA GGTCAAAAGG TCGTTGTTCC TTGTCTTGCC 

+2 yValGlnAsp ProGlnHisGlu Argllelie ThrValSer ThrAsnGlySer IleHisSe 

241 AGTACAAGAT CCTCAGCATG AGAGAAT7A7 TACTGTGTCT ACTAATGGAA GTATTCACAG 
TCATGTTCTA GGAGTCGTAC TCTCTTAA7A ATGACACAGA TGATTACCTT CATAAGTGTC 

+2 rProArgPhe ProHisThrTyr ProArgAsn ThrValLeu ValTrpArgLeu ValAlaVa 

3 01 CCCAAGGTTT CCTCATACTT ATC CAAG AAA TACGGTCTTG GTATGGAGAT TAGTAGCAGT 
GGGTTCCAAA GGAGTATGAA TAGGTTC777 ATGCCAGAAC CATACCTCTA ATCA7CGTCA 

+2 lGluGluAsn ValTrpIleGln LeuThrPhe AspGluArg PheGlyLeuGlu AspProGl 

3 61 AGAGGAAAAT GTATGGATAC AACTTACG77 TGATGAAAGA TTTGGGCTTG AAGACCCAGA 

TCTCCTTTTA CATACCTATG TTGAATGCAA ACTACTTTCT AAACCCGAAC T.TCTGGGTCT 

+2 uAspAspIle CysLysTyrAsp PheValGlu ValGlcGlu ProSerAspGly ThrlleLe 

421 AGATGACATA TGCAAGTATG ATTTTGTAGA AGT7GAGGAA CCCAGTGATG GAACTATATT 
TCTACTGTAT ACGTTCATAC TAAAACA7C7 TCAACTCCTT GGGTCACTAC CTTGATATAA 

+ 2 uGlyArgTrp CysGlySerGly ThrVaLPro GiyLysGin IleSerLysGIy AsnGinll 

4 31 AGGGCGCTGG TGTGGTTCTG GTACTGTACC AGGAAAACAG ATTTCTAAAG GAAATCAAAT 

TCCCGCGACC ACACCAAGAC CATGACA7G3 TCC7777G7C TAAAGATTTC CTTTAGTTTA 

+2 eArglleArg PheValSerAsp GluTyrPhe ProSerGIu ProGlyPheCys IleHisTy 

541 TAGGATAAGA TTTGTATCTG ATGAA7A777 TCCTTC7GAA CCAGGG7TCT GCATCCACTA 
ATCCTATTCT AAAC AT AG AC TACTTATAAA AGGAAGAC7T GGTCCCAAGA CGTAGGTGAT 

+2 rAsnlleVal MetProGlnPhe ThrGluAla ValSer Pro SerValLeuPro ProSerAl 

f^Ol CAACATTGTC ATGCCACAAT TCACAGAAOC 'VC,T( ',MV\\:CV Tt'AC ITGCTAC CCCC.TTCAGC 
GTTGTAACAG TACGGTGTTA AGTGTCT7CG ACACTCAGGA AGTCACGATG GGGGAAGTCG 

+■2 aLeuProLeu AspLeuLeuAsn AsnAlalle ThrAlaPhe SerThrLeuGlu AspLeuIl 

661 TTTGCCACTC GACCTGCTTA ATAATCCTA7 AACTGCCTTT ACTACCTTGC AAGACCTTAT 
AAACGGTGAC CTGGACGAAT TATTACGA7A TTGACGGAAA TCATGGAACC TTCTGGAATA 
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+ 2 eArgTyrLeu GluProGluArg TrpGlnLeu AspLeuGlu AspLeuTyrArg ProThrTr 

721 TCGATATCTT GAACCAGAGA GATGGCAGTT GGACTTAGAA GATCTATATA GGCCAACTTG 
AGCTATAGAA CTTGGTCTCT CTACCGTCAA CCTGAATCTT C TAG AT AT AT CCGGTTGAAC 

+2 pGlnLeuLeu GlyLysAlaPhe ValPheGly ArgLysSer ArgValValAsp LeuAsnLe 

7 81 GCAACTTCTT GGCAAGGCTT TTGTTTTTGG AAGAAAATCC AGAGTGGTGG ATCTGAACCT 
CGTTGAAGAA CCGTTCCGAA AACAAAAACC TTCTTTTAGG TCTCACCACC T AG ACTTG G A 

+2 uLeuThrGlu GluValArgLeu TyrSerCys ThrProArg AsnPheSerVal SerlleAr 

841 TCTAACAGAG GAGGTAAGAT TATACAGCTG CACACCTCGT AACTTCTCAG TGTCCATAAG 
AGATTGTCTC CTCCATTCTA ATATGTCGAC GTGTGGAGCA TTGAAGAGTC ACAGGTATTC 

+2 gGluGluLeu LysArgThrAsp ThrllePhe TrpProGly CysLeuLeuVal LysArgCy 

901 GGAAGAACTA AAGAGAACCG ATACCATTTT CTGGCCAGGT TGTCTCCTGG TTAAACGCTG 
CCTTCTTGAT TTCTCTTGGC TATGGTAAAA GACCGGTCCA ACAGAGGACC AATTTGCGAC 

+2 sGlyGlyAsn CysAlaCysCys LeuHisAsn CysAsnGlu CysGlnCysVal ProSerLy 

9 61 TGGTGGGAAC TGTGCCTGTT GTCTCCACAA TTGCAATGAA TGTCAATGTG TCCCAAGCAA 
ACCACCCTTG ACACGGACAA CAGAGGTGTT AACGTTACTT ACAGTTACAC AGGGTTCGTT 

+2 sValThrLys LysTyrHisGlu ValLeuGln LeuArgPro LysThrGlyVal ArgGlyLe 

1021 AGTTACTAAA AAATACCACG AGGTCCTTCA GTTGAGACCA AAGACCGGTG TCAGGGGATT 
TCAATGATTT TTTATGGTGC TCCAGGAAGT CAACTCTGGT TTCTGGCCAC AGTCCCCTAA 

+2 uHisLysSer LeuThrAspVal AlaLeuGlu HlsHlsGLu GluCysAspCys ValCysAr 

10 81 GCACAAATCA CTCACCGACG TGGCCCTGGA GCACCATGAG GAGTGTGACT GTGTGTGCAG 
CGTGTTTAGT GAGTGGCTGC ACCGGGACCT CGTGGTACTC CTCACACTGA CACACACGTC 

+2 gGlySerThr GlyGly 
> 

1141 AGGGAGCACA GGAGGATAGC CGCATCACCA CCAGCAGCTC TTGCCCAGAG CTGTGCAGTG 
TCCCTCGTGT CCTCCTATCG GCGTAGTGGT GGTCGTCGAG AACGGGTCTC GACACGTCAC 

12 01 CAGTGGCTGA TTCTATTAGA GAACGTATGC GTTATCTCCA TCCTTAATCT CAGTTGTTTG 
GTCACCGACT AAGATAATCT CTTGCATACG CAATAGAGGT AGGAATTAGA GTCAACAAAC 

12 61 CTTCAAGGAC CTTTCATCTT CAGGATTTAC AGTGCATTCT GAAAGAGGAG ACATCAAACA 

GAAGTTCCTG GAAAGTAGAA GTCCTAAATG TCACGTAAGA CTTTCTCCTC TGTAGTTTGT 

13 21 GAATTAGGAG TTGTGCAACA GCTCTTTTGA GAGGAGGCCT AAAGGACAGG AGAAAAGGTC 

CTTAATCCTC AACACGTTGT CGAGAAAACT CTCCTCCCGA TTTCCTGTCC TCTTTTCCAG 

13 31 TTCAATCGTG GAAAGAAAAT TAAATGTTGT ATTAAATAGA TCACCAGCTA GTTTCAGAGT 
AAGTTAGCAC CTTTCTTTTA ATTTACAACA TAATTTATCT AGTGGTCGAT CAAAGTCTCA 

14*11 TACCATGTAC GTATTCCACT AGCTGGGTTC TGTATTTCAG TTCTTTCGAT ACGGCTTAGG 
ATGGTACATG CATAAGGTGA TCGACCCAAG ACATAAAGTC AAGAAAGCTA TGCCGAATCC 

1501 GTAATGTCAG TACAGGAAAA AAACTGTGCA AGTGAGCACC TCATTCCGTT GCCTTGCTTA 
CATTACAGTC ATGTCCTTTT TTTGACACGT TCACTCGTGG ACTAAGGCAA CGGAACGAAT 



1561 ACTCTAAAGC TCCATGTCCT GGGCCTAAAA 
TGAGATTTCG AGGTACAGGA CCCGGATTTT 

1621 TTTGCTCATA TTCACATATG TAAACCAGAA 
AAACGAGTAT AAGTGTATAC ATTTGGTCTT 

1681 AAAGGAACTA TGTTGCTATG AATTAAACTT 
TTTCCTTGAT ACAACGATAC TTAATTTGAA 

1741 TCATATTTCT TATTAAAATT TCTGCCATTT 
AGTATAAAGA ATAATTTTAA AGACGGTAAA 

1801 AGAGATAAAC CTGAAAAGAA GAGTGGCCTT 
TCTCTATTTG GACTTTTCTT CTCACCGGAA 

18 61 TGTTTCATTG TGTACATTTT TATATTCTCC 

ACAAAGTAAC ACATGTAAAA ATATAAGAGG 

1921 TCTTGTTAAA TATATCTATT TTTACCAAAG 
AGAACAATTT ATATAGATAA AAATGGTTTC 

19 81 GATCAACTAT TTTTAGCTTG GTAAATTTTT 

CTAGTTGATA AAAATCGAAC CATTTAAAAA 

2 041 AAGATGATAT AAAATATTGT TGCTCTGACA 
TTCTACTATA TTTTATAACA ACGAGACTGT 

2101 CTAGAGTTAG ATTAATCTGC ATTTTAAAAA 
GATCTCAATC TAATTAGACG TAAAATTTTT 

2161 AAGACTTTTT GAAAATAATT AAATTATCAT 
TTCTGAAAAA CTTTTATTAA TTTAATAGTA 

2221 TAAAAAGCAA CTTATGAAAG TAGACATTCA 
ATTTTTCGTT GAATACTTTC ATCTGTAAGT 

22 81 TGGGGAAATC TGAGCCTAGC TC AG AAAAAC 

ACCCCTTTAG ACTCGGATCG AGTCTTTTTG 

23 41 TCCTGATAAA GCGTGCTGTG CTGTGCAGTA 

AGGACTATTT CGCACGACAC GACACGTCAT 

2401 GTTTTATTAT CTTAAACTCT GTTCCATACA 
CAAAATAATA GAATTTGAGA CAAGGTATGT 

2 461 CAGAAGTATG TCTCT 
GTCTTCATAC AGAGA 




TCGTATAAAA TCTGGATTTT TTTTTTTTTT 
AGCATATTTT AGACCTAAAA AAAAAAAAAA 

CATTCTATGT ACTACAAACC TGGTTTTTAA 
GTAAGATACA TGATGTTTGG ACCAAAAATT 

GTGTCGTGCT GATAGGACAG ACTGGATTTT 
CACAGCACGA CTATCCTGTC TGACCTAAAA 

AGAAGAAGAG AACTACATTC ATGGTTTGGA 
TCTTCTTCTC TTGATGTAAG TACCAAACCT 

ATCTTCACTT TATCGATAAG CCAGTTTATT 
TAGAAGTGAA ATAGCTATTC GGTCAAATAA 

TTTTGACATT ATAACTGTTG GCTTTTCTAA 
AAAACTGTAA TATTGACAAC CGAAAAGATT 

GTATTTAATA TTCTTTTTTA TGACAACTTA 
CATAAATTAT AAGAAAAAAT ACTGTTGAAT 

CTAAACACAA TTGTTATAGC CAGAGGAACA 
GATTTGTGTT AACAATATCG GTCTCCTTGT 

AAAATACATG TATTTCATTC TCGTATGGTG 
TTTTATGTAC ATAAAGTAAG AGCATACCAC 

ACTGAATTGG AATAGAATTG GTAAGTTGCA 
TG AC TTAACC TTATCTTAAC CATTCAACGT 

ATCTTCCATT CCTGTTATTG GAGATGAAAA 
TAGAAGGTAA GGACAATAAC CTCTACTTTT 

GATCCAGCCA TTACTAACCT ATTCCTTTTT 
CTAGGTCGGT AATGATTGGA TAAGGAAAAA 

ATAAAGCACC TTGAAAAAGA CTTGGCAGCT 
TATTTCGTGG AACTTTTTCT GAACCGTCGA 

GGAACACATC CTATTTATTG TGATGTTGTG 
CCTTGTGTAG GATAAATAAC ACTACAACAC 

CT7GTATAAA TACATGGATA TTTTTATGTA 
GAACATATTT ATGTACCTAT AAAAATACAT 
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New Sequence + Incyte ESTs 

1 ATTTGTTTAA ACCTTGGGAA ACTGGTTCAG GTCCAGGTTT TGCTTTGATC CTTTTCAAAA 
TAAACAAATT TGGAACCCTT TGACCAAGTC CAGGTCCAAA ACGAAACTAG GAAAAGTTTT 

61 AC TG GAG AC A CAGAAGAGGG CTTCTAGGAA AAAGTTTTGG GATGGGATTA TGTGGAAACT 
TGACCTCTGT GTCTTCTCCC GAAGATCCTT TTTCAAAACC CTACCCTAAT ACACCTTTGA 

121 ACCCTGCGAT TCTCTGCTGC CAGAGCAGGC TCGGCGCTTC CACCCCAGTG CAGCCTTCCC 
TGGGACGCTA AGAGACGACG GTCTCGTCCG AGCCGCGAAG GTGGGGTCAC GTCGGAAGGG 

181 CTGGCGGTGG TGAAAGAGAC TCGGGAGTCG CTGCTTCCAA AGTGCCCGCC GTGAGTGAGC 
GACCGCCACC ACTTTCTCTG AGCCCTCAGC GACGAAGGTT TCACGGGCGG CACTCACTCG 

+2 Met SerLeuPhe GlyLeuLeu LeuLeuThrSer AlaLeuAl 

j 

2 41 TCTCACCCCA GTCAGCCAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC 
AGAGTGGGGT CAGTCGGTTT ACTCGGAGAA GCCCGAAGAG GACGACTGTA GACGGGACCG 

+2 aGlyGlnArg GlnGlyThrGln AlaGluSer AsnLeuSer SerLysPheGln PheSerSe 



3 01 CGGCCAGAGA CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTC C AG 
GCCGGTCTCT GTCCCCTGAG TCCGCCTTAG GTTGGACTCA TCATTTAAGG TCAAAAGGTC 

+2 rAsnLysGlu GlnTyrGlyVal GlnAspPro GlnHisGlu ArgllelleThr ValSerTh 



3 61 CAACAAGGAA CAGTACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC 
GTTGTTCCTT GTCATGCCTC ATGTTCTAGG AGTCGTACTC TCTTAATAAT GACACAGATG 

+2 rAsnGlySer IleHisSerPro ArgPhePro HisThrTyr ProArgAsnThr ValLeuVa 



421 TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT 
ATTACCTTCA TAAGTGTCGG GTTCCAAAGG AGTATGAATA GGTTCTTTAT GCCAGAACCA 

+2 lTrpArgLeu ValAlaValGlu GluAsnVal TrpIleGln LeuThrPheAsp GluArgPh 



4 81 ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT 
TACCTCTAAT CATCGTCATC TCCTTTTACA TACCTATGTT GAATGCAAAC TACTTTCTAA 

+2 eGlyLeuGlu AspProGluAsp AspIleCys LysTyrAsp PheValGluVal GluGluPr 



541 TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC 
ACCCGAACTT CTGGGTCTTC TACTGTATAC GTTCATACTA AAACATCTTC AACTCCTTGG 

+ 2 oSerAspGly Thr IleLeuGly ArgTrpCys GlySerGly ThrValProGly LysGlnll 



601 CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT 
GTCACTACCT TGATATAATC CCGCGACCAC ACCAAGACCA TGACATGGTC CTTTTGTCTA 

+2 oSerLysGly A^nGlnl leArg XleArgPhe VulSorAsp GluTy r PhoPro SerGluPr 



661 TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC CTTCTGAACC 
AAGATTTCCT TTAGTTTAAT CCTATTCTAA ACATAGACTA CTTATAAAAG GAAGACTTGG 
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*2 oGlyPheCys IleHisTyrAsn IleValMec ProGlnPhe ThrGluAlaVal SerProSe 



721 AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAGTCCTTC 
TCCCAAGACG TAGGTGATGT TGTAACAGTA CGGTGTTAAG TGTCTTCGAC ACTCAGGAAG 

+ 2 rValLeuPro ProSerAlaLeu ProLeuAsp LeuLeuAsn AsnAlalleThr AlaPheSe 



781 AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG 
TCACGATGGG GGAAGTCGAA ACGGTGACCT GGACGAATTA TTACGATATT GACGGAAATC 

+2 rThrLeuGlu AspLeuIleArg TyrLeuGlu ProGluArg TrpGlnLeuAsp LeuGluAs 



841 TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG ACTTAGAAGA 
ATGGAACCTT CTGGAATAAG CTATAGAACT TGGTCTCTCT ACCGTCAACC TGAATCTTCT 

+2 pLeuTyrArg ProThrTrpGln LeuLeuGly LysAlaPhe ValPheGlyArg LysSerAr 



9 01 TCTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG 
AGATATATCC GGTTGAACCG TTGAAGAACC GTTCCGAAAA CAAAAACCTT CTTTTAGGTC 

+2 gValValAsp LeuAsnLeuLeu ThrGluGlu ValArgLeu TyrSerCysThr ProArgAs 



9 61 AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA 
TCACCACCTA GACTTGGAAG ATTGTCTCCT CCATTCTAAT ATGTCGACGT GTGGAGCATT 

+2 nPheSerVal SerlleArgGlu GluLeuLys ArgThrAsp ThrllePheTrp ProGlyCy 



1021 CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG 
GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA TGGTAAAAGA CCGGTCCAAC 

+2 sLeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys LeuHisAsnCys AsnGluCy 



1081 TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT GCAATGAATG 
AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA GAGGTGTTAA CGTTACTTAC 

+2 sGlnCysVal ProSerLysVal ThrLysLys TyrHisGlu ValLeuGlnCeu ArgProLy 



1141 TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT TGAGACCAAA 

AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC CAGGAAGTCA ACTCTGGTTT 

+2 sThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal AlaLeuGluHis HisGluGl 

1201 GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC ACCATGAGGA 

CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC CCGGACCTCG TGGTACTCCT 

+2 uCysAspCys ValCysArgGly SerThrGly Gly 



12 51 GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG CATCACCACC AGCAGCTCTT 

CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC GTAGTGGTGG TCGTCGAGAA 

13 21 GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA ACGTATGCGT TATCTCCATC 

CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT TGCATACGCA ATAGAGGTAG 

13 31 CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA GGATTTACAG TGCATTCTGA 
GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT CCTAAATGTC ACGTAAGACT 




1441 AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC TCTTTTGAGA GGAGGCCTAA 

TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCG AGAAAACTCT CCTCCGGATT 

1501 AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA AATGTTGTAT TAAATAGATC 

TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TTCTTTTAAT TTACAACATA ATTTATCTAG 

1561 ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG CTGGGTTCTG TATTTCAGTT 

TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC GACCCAAGAC ATAAAGTCAA 

1621 CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA ACTGTGCAAG TGAGCACCTG 

GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT TGACACGTTC ACTCGTGGAC 

1681 ATTCCGTTGC CTTGGCTTAA CTCTAAAGCT CCATGTCCTG GGCCTAAAAT CGTATAAAAT 

TAAGGCAACG GAACCGAATT GAGATTTCGA GGTACAGGAC CCGGATTTTA GCATATTTTA 

1741 CTGGATTTTT TTTTTTTTTT TTGCGCATAT TCACATATGT AAACCAGAAC ATTCTATGTA 

GACCTAAAAA AAAAAAAAAA AACGCGTATA AGTGTATACA /TTTGGTCTTG TAAGATACAT 

1801 CTACAAACCT GGTTTTTAAA AAGGAACTAT GTTGCTATGA ATTAAACTTG TGTCATGCTG 

GATGTTTGGA CCAAAAATTT TTCCTTGATA CAACGATACT TAATTTGAAC ACAGTACGAC 

1861 ATAGGACAGA CTGGATTTTT CATATTTCTT ATTAAAATTT CTGCCATTTA GAAGAAGAGA 

TATCCTGTCT GACCTAAAAA GTATAAAGAA TAATTTTAAA GACGGTAAAT CTTCTTCTCT 

19 21 ACTACATTCA TGGTTTGGAA G AG AT AAAC C TGAAAAGAAG AGTGGCCTTA TCTTCACTTT 

TGATGTAAGT ACCAAACCTT CTCTATTTGG ACTTTTCTTC TCACCGGAAT AGAAGTGAAA 

19 81 ATCGATAAGT CAGTTTATTT GTTTCATTGT GTACATTTTT ATATTCTCCT TTTGACATTA 

TAGCTATTCA GTCAAATAAA CAAAGTAACA CATGTAAAAA TATAAGAGGA AAACTGTAAT 

2041 TAACTGTTGG CTTTTCTAAT CTTGTTAAAT ATATCTATTT TTACCAAAGG TATTTAATAT 

ATTGACAACC GAAAAGATTA GAACAATTTA TATAGATAAA AATGGTTTCC ATAAATTATA 

2101 TCTTTTTTAT GACAACTTAG ATCAACTATT TTTAGCTTGG TAAATTTTTC TAAACACAAT 

AGAAAAAATA CTGTTGAATC TAGTTGATAA AAATCGAACC ATTTAAAAAG ATTTGTGTTA 

2161 TGTTATAGCC AG AGG AAC AA AGATGATATA AAATATTGTT GCTCTGACAA AAATACATGT 

ACAATATCGG TCTCCTTGTT TC TACT AT AT TTTATAACAA CGAGACTGTT TTTATGTACA 

2 221 ATTTCATTCT CGTATGGTGC TAGAGTTAGA TTAATCTGCA TT TT AAAAAA CTGAATTGGA 

TAAAGTAAGA GCATACCACG ATCTCAATCT AATTAGACGT AAAATTTTTT GACTTAACCT 

22 81 ATAGAATTGG TAAGTTGCAA AGACTTTTTG AAAATAATTA AATTATCATA TCTTCCATTC 

TATCTTAACC ATTCAACGTT TCTGAAAAAC TTTTATTAAT TTAATAGTAT AGAAGGTAAG 

23 41 CTGTTATTGG AGATGAAAAT AAAAAGCAAC TTATGAAAGT AGACATTCAG ATCCAGCCAT 

GACAATAACC TCTACTTTTA TTTTTCGTTG AATACTTTCA TCTGTAAGTC TAGGTCGGTA 

2401 TACTAACCTA TTCCTTTTTT GGGGAAATCT GAGCCTAGCT CAGAAAAACA TAAAGCACCT 

ATGATTGGAT AAG G AAAAAA CCCCTTTAGA CTCGGATCGA GTCTTTTTGT ATTTCGTGGA 

24 61 TGAAAAAGAC TTGGCAGCTT CCTGATAAAG CGTGCTGTGC TGTGCAGTAG GAACACATCC 

ACTTTTTCTG AACCGTCGAA GGACTATTTC GCACGACACG ACACGTCATC CTTGTGTAGG 

2 521 TATTTATTGT GATGTTGTGG TTTTATTATC TTAAACTCTG TTCCATACAC TTGTATAAAT 

ATAAATAACA CTACAACACC AAAATAATAG AATTTGAGAC AAGGTATGTG AACATATTTA 
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2581 ACATGGATAT TTTTATGTAC AGAAGTATGT CTCTTAACCA GTTCACTTAT TGTACTCTGG 

TGTACCTATA AAAATACATG TCTTCATACA GAGAATTGGT CAAGTGAATA ACATGAGACC 

2641 CAATTTAAAA GAAAATCAGT AAAATATTTT GCTTGTAAAA TGCTTAATAT CGTGCCTAGG 

GTTAAATTTT CTTTTAGTCA TTTTATAAAA CGAACATTTT ACGAATTATA GCACGGATCC 

2701 TTATGTGGTG ACTATTTGAA TCAAAAATGT ATTGAATCAT CAAATAAAAG AATGTGGCTA 

AATACACCAC TGATAAACTT AGTTTTTACA TAACTTAGTA GTTTATTTTC TTACACCGAT 

2761 TTTTGGGGAG AAAATT 
AAAACCCCTC TTTTAA 




Additional oligonucleotides used for amplification of entire 
coding region 



5'-1 



T7TGTTTAAACCTTGGGAAACTGG 



5'-2 



GTCCAGGTTTTGCTTTGATCC 
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DNA Sequence Of Clones 4 & 7, Identical Clones Containing The 
Entire Open Reading Frame 

1 TTTGTTTAAA CCTTGGGAAA CTGGTTCAGG TCCACGTTTT GCTTTGATCC TTTTCAAAAA 
AAACAAATTT GGAACCCTTT GACCAAGTCC AGGTCCAAAA CGAAACTAGG AAAAGTTTTT 

61 CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGG ATTATGT GGAAACTACC 
GACCTCTGTG TCTTCTCCCG AGATCCTTTT TCAAAACCTA CCCTAATACA CCTTTGATGG 

121 CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG CCTTCCCCTG 
GACGCTAAGA GACGACGGTC TCGTCCGAGC CGCGAAGGTG GGGTCACGTC GGAAGGGGAC 

181 GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG AGTGAGCTCT 
CGCCACCACT TTCTCTGAGC CCTCAGCGAC GAAGGTTTCA CGGGCGGCAC TCACTCGAGA 

+2 MetSer LeuPheGly LeuLeuLeu LeuThrSerAla LeuAlaGl 

] 

2 41 CACCCCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG CCCTGGCCGG 
GTGGGGTCAG TCGGTTTACT CGGAGAAGCC CGAAGAGGAC GACTGTAGAC GGGACCGGCC 

+2 yGlnArgGln GlyThrGlnAla GluSerAsn LeuSerSer LysPheGlnPhe SerSerAs 



3 01 CC AG AG AC AG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT TTTCCAGCAA 
GGTCTCTGTC CCCTGAGTCC GCCTTAGGTT GGACTCATCA TTTAAGGTCA AAAGGTCGTT 

+2 nLysGluGln AsnGlyValGln AspProGln HisGluArg IlelleThrVal SerThrAs 



3 61 CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATTATTACTG TGTCTACTAA 
GTTCCTTGTC TTGCCTCATG TTCTAGGAGT CGTACTCTCT T AAT AATG AC AC AG ATG AT T 

+2 nGlySerlle HisSerProArg PheProHis ThrTyrPro ArgAsnThrVal .LeuValTr 



421 TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG TCTTGGTATG 
ACCTTCATAA GTGTCGGGTT CCAAAGGAGT ATGAATAGGT TCTTTATGCC AGAACCATAC 

+2 pArgLeuVal AlaValGluGlu AsnValTrp IleGlnLeu ThrPheAspGlu ArgPheGl 



431 GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG 
CTCTAATCAT CGTCATCTCC TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC 

+2 yLeuGluAsp ProGluAspAsp IleCysLys TyrAspPhe ValGluValGlu GluProSe 



541 GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG 
CGAACTTCTG GGTCTTCTAC TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC 

+2 rAspGlyThr IleLeuGlyArg TrpCysGly SerGlyThr ValProGlyLys GlnlleSe 



601 TGATGGAACT ATATTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC 
ACTACCTTGA TATAATCCCG CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG 

+2 rLysGlyAsn GlnlleArglle ArgPheVal SerAspGlu TyrPheProSer GluProGl 



661 TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCAGG 
ATTTCCTTTA GTTTAATCCT ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTCGTCC 
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+2 yPheCysIle HisTyrAsnlle ValMetPro GInPheThr GluAlaValSer ProSerVa 



721 GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA GTCCTTCAGT 
CAAGACGTAG GTGATGTTGT AACAGTACGG TGTTAAGTGT CTTCGACACT CAGGAAGTCA 

+2 lLeuProPro SerAlaLeuPro LeuAspLeu LeuAsnAsn AlalleThrAla PheSerTh 



781 GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG CCTTTAGTAC 
CGATGGGGGA AGTCGAAACG GTGACCTGGA CGAATTATTA CGATATTGAC GGAAATCATG 

+2 rLeuGluAsp LeuIleArgTyr LeuGluPro GluArgTrp GlnLeuAspLeu GluAspLe 



8 41 CTTGGAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT TAGAAGATCT 
GAACCTTCTG GAATAAGCTA TAGAACTTGG TCTCTCTACC GTCAACCTGA ATCTTCTAGA 

+2 uTyrArgPro ThrTrpGlnLeu LeuGlyLys AlaPheVal PheGlyArgLys SerArgVa 



9 01 ATATAGGCCA AC TTGGC AAC TTCTTGGCAA GGCTTTTGTT TTTGGAAGAA AATCCAGAGT 
TATATCCGGT TGAACCGTTG AAGAACCGTT CCGAAAACAA AAACCTTCTT TTAGGTCTCA 

+2 lValAspLeu AsnLeuLeuThr GluGluVal ArgLeuTyr SerCysThrPro ArgAsnPh 



961 GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC CTCGTAACTT 
CCACCTAGAC TTGGAAGATT GTCTCCTCCA TTCTAATATG TCGACGTGTG GAGCATTGAA 

+2 eSerValSer IleArgGluGlu LeuLysArg ThrAspThr IlePheTrpPro GlyCysLe 



10 21 CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC CAGGTTGTCT 
GAGTCACAGG TATTCCCTTC TTGATTTCTC TTGGCTATGG TAAAAGACCG GTCCAACAGA 

+2 uLeuValLys ArgCysGlyGly AsnCysAla CysCysLeu His AsnCysAsn GluCysGl 



10 81 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA 
GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT TACTTACAGT 

+2 nCysValPro SerLysValThr LysLysTyr HisGluVal LeuGlnLeuArg ProLysTh 



1141 ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC 
TACACAGGGT TCGTTTCAAT GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG 

+2 rGlyValArg GlyLeuHisLys SerLeuThr AspValAla LeuGluHisHis GluGluCy 



12 01 CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC ATGAGGAGTG 
GCCACAGTCC CCTAACGTGT TTAGTGAGTG GCTGCACCGG GACCTCGTGG TACTCCTCAC 

+2 sAspCysVal CysArgGlySer ThrGlyGly 



1261 TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

13 21 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA GAGGTAGGAA 

13 81 AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG 

TTAGAGTCAA CAAACGAAGT TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC 

1441 AGGAGACATC AAACAGAATT AGGAGTTGTG CAA 

TCCTCTGTAG TTTGTCTTAA TCCTCAACAC GTT 
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Predicted Full-length Polypeptide Sequence 



1 MSLFGLLCXT SALAGQRQGT QAESNLSSXF QFSSNXEQYG VQDPQHERII 

51 TVSTNGSIHS PRFPHTYPRN TVLVWRLVAV EENVWIQLTF DERFGLEDPE 

101 DDICKYT5FVE VEEPSDGTIL GRWCGSGTVP GKQISXGNQI RIRFVSDEYr 

151 PSEPGFCIKY NIVMPQFTEA VSPSVLPPSA LPLDLLNNAI TAFSTLEDLI 

201 RYLEPERWQL DLEDLYRPTW QLLGKAFVFG RXSRWDLNL LTEEVRLYSC 

251 TPRNFSVSIR EELXRTDTIF WPGCLLVXRC GGNCACCLKN CNECQCVPSX 

3 01 VTKXYHEVLQ LRPXTGVRGL HX3LTDVALE HHSECDCVCR GSTGG 
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Alignment of VEGF-X with Other VEGFs 



VEGF_HUMAN 
PLGFJiUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
9 90126vegx 



20 



40 



MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 



50 



VEGF__ HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC__HUMAN 
VEG D — HUMAN 
990126vegx 



VEGF^HUMAN 
PLGF__HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD„HUMAN 
990126vegx 



60 * 80 * 100 

T VSTNG S I H S PRFPHTYPRNTVLVWRLVAVEENVWI QLTFDERFGLEDPE : 100 

120 * 140 * 

MHLLGFFSVACSLLAAALLPGPREAPAAAA : 3 0 

MYREWVVVNV : 10 

DDICKYDFVEV--EEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDE : 148 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC — HUMAN 
VEGD„HUMAN 
990126vegx 



160 * 180 * 200 

m . 2 

MP . 2 

AFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKM : 80 

FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE : 60 

YFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLED : 198 



* 220 * 240 * 

VEGF_HUMAN : FLLSWVHWSLALLLYLHHAKWSQAAPMAEGGGQNHHEWKFMD- VYQ0SY : 51 

PLGF_HUMAN : VMRLF PC F LQ LLAG L ALP A VP PQQWAL SAGNG S S EVEWPFQE - WJgSs Y : 51 

VEGB_HUMAN : MSPLLRRLLLAALLQLAPAQAPVSQPDAPGHQRKWSWID-VYTraAT : 4 6 

VEGC_HUMAN : YKCQLRKGGWQHNREQANLNSRTEETIKFAAAHYNTEILKSIDNEWrHtQ : 13 0 

VEGD_KUMAN : D^LWRCRLRLKSFTSMDSRSASHRSTRFAATFYDIETLKVIDEEWQgTQ : no 

99012 6vegx : LIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEvSlY : 248 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEG D_ HUMAN 
990125vegx 



260 

chpietlvdifqHypdeieyif: 

CRALERLVDVVS ijYPSEVEHMFS 

cq pre vwpltv glmgtvakql' 
cmprevcidvgkSfgvatntff: 
cs pretcvevas glgkstntff". 
sctprnfsvsir@elkrtdtifwhi 



280 * 300 

SgS — OTnd--egleBSp 
ITS — jjgd--enlhBBp 
-jjpd--dgle8Hp 

-J «NS--EGLQfflgN 

- j ■ne~-esli858n 

jcaBB)lhncnecqB8Bp 
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VEGF_HUMAN 
PLGF_HUMAN 
VEGB__HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



320 



!lkm 

LMU 



TEESNITMi 
VETANVTM< 
TGQHQVRM 
TSTSYLSK 1 
TSTSYISK' 
SKVTKKYHEffiLQ 



IKPHQG- 
RSGDR- 
JRYPS-- 



-QHIGESSFLQ 
-PSYVEWTFSQ 



— sqlgewslee 

feqtvplsqg pkpvthsf. 

fe i. svpltsv pelvpwkv, 

IrpktgvrglhkslteJJale 




340 * 

EgRPKKDRARQEK : 141 

•^RPLREKMKPER : 141 

RPKKKDSAVKP : 13 5 

SKLDVYRQVH : 222 

KfflLPTAPRHPYSI : 2 02 

DgVCRGSTGG : 3 45 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC„HUMAN 
VEGD_HUMAN 
990126vegx 



360 * 380 
KSVRGKGKGQKRKRKKSRYKSWSVP 



400 



DSPR 

SIIRRSLPATLPQCQAANKTCPTMYT4WNNHICRCLAQEDFMFSSDAGDDS 
IRRSIQIPEEDRCSHSKKLCPIDMLWDSNKCKCVLQEENPLAGT 



166 

139 
272 
246 



VEGF„HUMAN 
PLGF_HUMAN 
VEGB_KUMAN 
VEG C — HUMAN 
VEGD_HUMAN 
9 9012 6vegx 



420 



440 



TDGFKDICGPNKELDEETCQCVC RAGLRPASCGPHKELDRNSCQCVCKNK 
EDHSHLQEPALCGP 



322 
260 



VEGF_HUMAN 
PLG F_HUMAN 
VEGB_HUMAN 
VEGC„HUMAN 
VEG D_HUMAN 
990126vegx 



460 * 480 * 500 

CGPCSERRKHLFVQDPQTCKC- SCKNTDSRCKARQLELNER : 2 0 6 

CGDAVPRR : 149 

PLCPRCTQHHQRPDPRTCRCRCRRRSFLRCQGRGLELNPD : 17 9 

LFPSQCGANREFDENTCQCVCKRTCPRNQPLNPGKCACECTESPQKCLLK : 3 7 2 

HMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETCCQKHKLFHPD : 310 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB — HUMAN 
VEGC„HUMAN 
VEGD__HUMAN 
990126vegx 



520 



540 



TCRCDKPRR- 



TCRCRKLRR 

GKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEYCRCVPSYWKRPQMS- 
TCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP 



215 

1S8 
419 
354 
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FL_seq 

clone41 

clone20 



Variant Polypeptide Sequences 



FL__seq 

clone41 

clone20 



\ 

3 



20 



40 



ESLFGLLLLTSAIxAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
[SLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
[SLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDFQHERII 



TVSTNGSIHSPRFPHTYPRlsriVtiViroLVAVEEISIVt^IQLTFDERFGLEDPE 
TVSTNG S IHS PRFPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPE 
WSTNGSIHSPRFPHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPE 



DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 



FL_seq 

clone41 

clone20 



FL_ seq 

clone41 

clone20 



PSEP 
PSEP 
PSEP 



160 * 180 * 200 

GFCIHYNnvffiPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLEDLI : 20 0 

SNRGGKIRQfflHTS : 167 

GFCIHYNUVjSpQFTEAVSPSVLPPSALPLDLLNNAITAFSTLEDLI : 20 0 



FL_seq 

clone41 

clone20 



220 * 240 

RYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLYSC : 2 5 0 

RYLEPERWQLDLEDLYRP™QLLGKAFVFGRKSRWDLNLLTE : 243 



FL_seq 

clone41 

clone20 



260 * 280 * 300 

TPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSK 



300 



FL_seq 
clone41 
clone2 0 



* 320 * 340 

VTKKYHEVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



EVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



345 
282 
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Primers for Expression of VEGF-X 

E.coli expression of domain- 

vegx-6 AATTQG ATCC GAG AGTG GTG G ATCTG AACC 

vegx-7 AATTGGATCCGGGAAGAAAATCCAGAGTGG 

vegx-8 GGTTGAATTCATTATTTTTTAGTAACTTTGCTTGGGACAC 

vegX-9 AATTGAATTCATTATCCTCCTGTGCTCCCTC 

Baculovirus/insect cell expression of full-length protein- 
vegbad 

AATTGGATCCGGAGTCTCACCATCACCACCATCATGAATCCAACCTGAGTAGTAAATTC 
C 

vegbac2 AATTGAATTCGCTATCCTCCTGTGCTCCCTCTGC 
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>3993180H1 LUNGNONO 3 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATC7CCA7 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 

AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 

TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAG 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTT 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

>25 5 9 870H1 ADRETUT01 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCA 

TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

>3979767H1 LUNGTUTO 8 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAG 

ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTG 

GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

>3 98 0011H1 LUNGTUTO 8 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTA.GAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGG 

AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

>4825396H1 BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAAT" 
GCAATGAATGTCAATGTGTCCCAAGCAAJ\GTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 
AGGGGATTGC AC AAATCACTCACCGACGTGGCCCTGGAGCACC ATGAGGAGTG'TG ACTGTGTGTGC AGAGGGAGC AC A. GG 
AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNT01 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGA7TATACAGCTGCACACCTCGTAACTTCTCAGT 
GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAAC7 
GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAJ\GTTAC7.«AAAATA.CCACGAGGTCCTTCAG 
TTG AG AC C AAAG AC CGGTGTC AGGGGATTGC AC AAATC A 
>1302516H1 PL AC NOT 02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAA.CCTTCTAACAGAGGAGGTAAGA7TATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGA7ACCATTTTCTGGCCAGGTTGTC7 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAA77GCA.\TGA.-.7GTCAATGTGTCCC.AA 1 GCA^- :i .G77 
ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOT01 INCYTE 

AT TT C AT CTTCAGGATTTACAGTGCATTCTG AAAN AG G A G AAA 7 C AAA 7 AN AA 7 7 A 3 G A 3 7 T G 7 G C AA C A G C 7 C 77 7 7 7 A 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAA^TGTTGTATTAAATAGATCACCAGCTA 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>4713188H1 BRAIKCT01 INCYTE 

CAAAGTT ACT AAAAAATACCACGAGGTCCTTC AG TTG AG ACC AAAG ACCGGTGTC AGGGGATTGC AC AAATC ACTC AC CG 
ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGC AGAGGGAGC AC AGGAGG AT AGCCGCATCACCACCAGC AG 
CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACG7A7GCG77A7CTCCA7CCTT.AATCTCAGTTG7 
TTGCT 

>458823H1 KERANOT01 INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTA7TAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGT7 

GTTTGNTTCAAGGACCTTTCATCTTCAGGA.TTTACAGTGCATTCTGAAAGAGGAGACATCAA-ACAGAATTA.GGAGTTGTG 

CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAJVGGTCTTCAATCGTGG^ 

ATAGATC 

>1303909H1 P LAC NOT 02 INCYTE 

AGGA.AATCAAATTAGGATA.2iGATTTGTATCTGATGAATA77TTCCTTC7GA-\CCT7CT^CAGACGAGGTA.^GATTATA: 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGA7ACCATTTTCTGGCCAGGTTGTC7 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAA7GTCAATG7GTCCCAAG 
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>2739211H1 OVAHNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 
TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 
GTGAGCACCTGAT 

>3325591H1 PTHYNOT03 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTC NNGGGC N AAAANC G AAAAAT 

>3733565H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 
AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAA^GGACAGGAGAAAAGGTCTM^ 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>3554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT 

ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAA^.G 

CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCACATATGTAAACCAGN 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDTOL INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 
GTCAGTACAGGAAAA.A\ACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTA\AGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>4163378H1 3RSTNOT32 INCYTE 

PuATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATA.CG 
GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTA^.CTCTAJ^AGCTCC 
ATGTCCTGGGCCTAAAATCGTATA 
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>2054675H1 BEPINOT01 INCYTE 

AAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATA.TTTCTTATTAAAP.TT 

TCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTT 

TATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAA 

TCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTA 

>3 9 93180H1 LUNGNON03 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 
GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 
CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 
AATT AGGAGTTGTGC AAC AGC TC TTTTGAGAGGAGGCT AAAGG AC AGGAG AANAGGTC TT 
>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 
TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATT AGGAGTTGTGC AACAGCTCTTTTGAGAG 
GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTT 
TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 
>4164633H1 BRSTNOT32 INCYTE 

CTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTANTTATGACAACTTAGATCAACTATTTTTAGCTTG 

GTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATG 

TATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCA 

AAGACTTTTTGANAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGGGGAGAAAAT 

>2559870H1 ADRETUT01 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCA 

TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

>3817470H1 BONSTUT01 INCYTE 

TTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAA 

AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTC 

ACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTC 

TAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTT 

>3979767H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGA.TTCTATTAGAGAACGTATGC 
GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTT AC AGTGCATTCTGAAAG AGGAG 
ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTGTTCAATCGTG 
GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 
>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACG7ATGC 
GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 
CATCAAACAGAATT AGGAGTTGTGC AAC AGCTCTTTTGAGAGGAGGCCTAAAGGAC AGGAG AAAAGGTCTTCAATCGTGG 
AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 
>4825396H1 BLADDITO 1 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAA7T 
GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAA.TACCACGAGGTCCTTCAGTTGAGA.CCAJAAGACCGGTGTC 
AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 
AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNT01 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTCCGAAC7 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAG 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>862169H1 BRAITUT03 INCYTE 

AGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTC^ 

TTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTC 
CTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTT 
GCUCAAATCTGAGCCTAGC 

>4201385H1 BRAITUT2 9 INCYTE 

TTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTAT 
TAA.\ATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTCGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATCT 
TCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTCTACATTTTTATATTCTCCTTTGACATATAACTGTTGGCTTTT 
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CTAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATAT 
>1302516H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGA-rtCCTTCTAACAGAGGAGGTAAGATTATAC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTC7 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTT 
ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOTOl INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGAAATCAAACANAATTAGGAGTTGTGCAACAGCTCTTTTGA 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAGATCACCAGC7A 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAG 

TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>2549720H1 LUNGTUT06 INCYTE 

TTAGCTTGGNAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAA 
AATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAJVAAACTGAATTGGAATAGAATTGGT 
AAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACT 
TATGANAGTAG 

>877279H1 LUNGAST01 INCYTE 

CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAA 
GATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCAT 
TTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGGCTTTTTGAAAATAATTAAATTATCATATCTTCCATTCC 
TGTTATTGGNGG 

>4713188K1 BRAIHCTOl INCYTE 

CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 
ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 
CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAOTTCT 
TTGCT 

>2171082H1 ENDCNOT03 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTA.CAT7TTTA 
TATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATA77 
CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAArt 
GATGA 

>875860H1 LUNGASTOl INCYTE 

CTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACC 
TGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCC7 
TTTGACAT7ATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTAT 
GAC 

>706168H1 SYNORAT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGANCTATGTTGCTATGAA7 
TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGC'CATTTAGAAGAAGAGAAC 
7ACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCANTTTATCGATAAGTCAGTTTATT7G7 
TTCA 

>458S23H1 KERANOTO 1 INCYTE 

ANGAGTTGCCCAGAGC7GTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGT7ATC7CCATCCTTAATCTCAG77 
GTTTGNTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGT7GTG 
CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAJi 
ATAGATC 

>538436K1 LNODNOT02 INCYTE 

AAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 
CATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCA7ATC7TCCA7 
TCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACT.A.ACCTAT 
>130390 9H1 ? LAC NOT 02 INCYTE 

AGGAAATCAAAT7AGGATAAGATTTGTATCTGATGAATATTTTCCT7CTGAACC7TCTAACAGAGGAGGTAAGATTA7AC 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCAT'l'TTCTGGCCAGGTTG rCT 
CCTGGTTAAACGC7GTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
>2739211H1 OVARNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGA 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACG 
TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAA 
GTGACCACCTGAT 
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>2550343H1 LUNGTUT06 INCYTE 

TGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTC^^TCTTGTTAAATATATCTATTTTTACCAAAG 
GTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGC 
CAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTA 
>5321148H1 FIBPFEN06 INCYTE 

CACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGNCAAAAATACATGTATTTCATTCTCGTA 
TGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAA 
TAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAAATTCAGATCCAC 
CATTACTAAC 

>879495H1 THYRNOT02 INCYTE 

ATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAA 
AGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGT 
AGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCT 
TGAAAAA 

>3325591H1 PTHYNOT03 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATT 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>543890H1 OVARNOT02 INCYTE 

TTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 
TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGNATAGAATTGGTAAGTTGCAAAGNCTT 
TTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGGATGGAAAATAAAAAGCAACTTATGGAAAGTAGG 
ACATTCAGATC 

>3733565H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAATT 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>4641939H1 PROSTMT03 INCYTE 

GTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCCATGCTGATAGGACAGACTGGAT 
TTTNCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGA-ACTACA.TTCATGGTTTGGNAGAGATAAACCTGAAAA 
GAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATGTGTACATTTTTATATTCTCCTTTGACAT 
ATAACGTGGCTTT 

>2007780H1 TESTNOT03 INCYTE 

TTATATTCTCCTTTTGAC ATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAAT 
ATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAAC 
AAAGATGATATAAAATATTGTTGCTCTGANAAAAATACATGTAT 
>3085331H1 KEAONOT03 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATTTGCTATGAATT 
AAACTTGTGTCGTGCTGATAGGACAGACTGGNTTTTTCATATTTCTTATTANAATTTCTGCCATTAGAAGAAGAGAACTA 
CATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATTTCACTTTATCGATAAGTCAGT 
>3414043H1 PTHYNOT04 INCYTE 

G C TC A i ATTC AC AT ATG T AAACC AG AAC A iT C T ATG T AC T AC AAACCTG G TTTTT AAAAAG C AAC T ATG TTG C T ATG AAT 
TAJ^ACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGPiAGAAGAGAAC 
TACATTCATGGTTTGGAAGAGATAAACCTGAAA 
>3705963H1 PENCNOT07 INCYTE 

AN\ACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAA-ATCGTATAAAA 
TCTGGAnnnnnnnnnnnnnnnnnnGCTCATATTCACATATC^ 

AJ^AAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 
TTC7GCCATTAGAAGAAGAGAACTACNTTCANGGTTTGGAAGAGATAACCCTGAAAAGANGGG 
>5I37051H1 OVARDIT04 INCYTE 

AAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTN^ 

TATTCGAGATGAANATAAAAAGCAACTTATGAAAGTAGACAT""CAGATCCAGCC ATTACH' AACCTATTCCTTTTTTGGGC 
AAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTCAAAAAGACTTCGCACCTTCCTGATAAAGCGTGCTGTNTGTCA 
GTAGGAACACATCCTATTTATTGTGATGNTGTGGTTTATTAT 
>3554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGAT 
ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCTTAACTCTAAAG 
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CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTNTTTTTTTTTGCGCATATTCACATATGTAAACCAGM 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDTO 1 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAAT 
GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCC 
TAAAATCGTATAAAATCTGGA 

>1955646H1 CONNNOT01 INCYTE 

TGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGC 
AACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAA 
ACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGGAACACATCCTATTTA 

TTGTGATGTTGTGGTTTATATCCTAAACC 
>4163378H1 BRSTNOT32 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 
GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 

ATGTCCTGGGCCTAAAATCGTATA 

>5095141H1 EPIMNON05 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATNTTCACTTTATCGATAAGTCAGNTTATTTGTTTCATTGTGTACATTTOTIA 
TATTCTCCTTTTGACATTATAACTGNTGGCTTTTCTAANCNTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATT 

CTTT 

>94 3 82 6H1 ADRENOT03 INCYTE 

TATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAA 

AATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATG 

>3451273H1 UTRSNON03 INCYTE 

TTTTTTNTTTTGCTCATATTCACATATGTAAACCNGAACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAACTATG 
TTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCANATTTCTTANTAANNTTTCTGCCATTTAG 

AAGA 

>1402278H1 LATRTUT02 INCYTE 

GTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAA 
ATCGTATAAAATCTGGAnnnnnnnnnnnnnnnnnnGCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAA 
CCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTT 

CTTA 

>4361191H1 SKIRNOT01 INCYTE 

GCAAAGACTTTTTGANAATNATTAANTTATCATATCTTCCATTCCTGTTATNGGAGATGANA^TAAAAAGCAACTTATGA 
AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTT7GGGGAAATCTGAGCCTAGCNCAGAAAAACATAAAGC 
ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCNATTTATTGTGNTGTM 

GMGGTTTTATGATC 

>1307017H1 PL AC NOT 0 2 INCYTE 

TGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGC 

CTAAAATCGTATAAAATCTGGAnnnnnnnnnnnnnnnnnnGCTCATATTCACATATGTAAACCAGA^CATTCTATGTAC 

ACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCA 

TAT 

>5032225H1 HEARFET03 INCYTE 

AATTATCATATCTTCCATTCCTGTTATTGGAGATGNAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCAT 
TACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAA-AAACATAAAGCACCTTGAAAAAGACTGTCAGCTTC 
CTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCGTT 

CCAT 

>3732621K1 SMCCNOS01 INCYTE 

ANAGATGATATAAAANATTGTTGCTCTGACAANNATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTG 
CNTTTTAAAAAACTGANTTGGAATAGANTTGGTAAGTTGCAAAGNCNTTTGAAAATNATTAAGTTATCAGAT 
>3530274H1 BLADNOT09 INCYTE 

TTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATT 

CCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAakAGCG 

TGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTAAACTCTGTTCCATACACTTG 

TATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCA 

>3530249H1 BLADNOT09 INCYTE 

CTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGANAGTAGACATTCAGATCCAGCCATTACTAACCTAT 
TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGC 
GTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACT 
TGTATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACCTGG 



VEGFE1 

VEGFE2 

VEGFE3 

VEGFE4 

VEGFE5 

VEGFE6 

VEGFE7 

VEGFE8 

VEGFE9 

VEGFE10 



AAAATGTATGG ATAC AA CTTA C 
GTTTGATGAAAGATTTGGGCTTG 
TTTCT AAA GG AAATC AAATTA G 

GATAAGATTTGTATCTGATG 

GATGTCTCCTCTTTCAG 

GCACAACTCCTAATTCTG 

AGCACCTGATTCCGTTGC 

TAGTACATAGAATGTTCTGG 

AAGAGACATACTTCTGTAC 

CCAGGTACAATAAGTGAACTG 
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+ 3 M N I F L L 

NLLT EEV RLY 

] 



1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TC CTTTAGTT TAATCCTATT CTAAACATAG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3 S C T P RNF SVS IREE LKR 
TDT IFWP GCL 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 

-2 < 



+ 3 LVK RCGG NCA CCL HNCN 
ECQ CVP SKV 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 

GG AC CAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 



+ 3TKKY HEV LQLR PKT GVR 
GLKK SLT D V A 



+ 1 V 
D C T N H S P T W P 



241 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GGCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 

-2 

[ 



+ 3LEHH EEC DCV CRGS TGG 



> 

+ 2 VQREHRR 
IAAS P P A ALA 



+ 1 WST MRSV TVC AEG AQED 
SRI TTS SSC 



321 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QSCA VQW LIL LENV CVI 
SIL NLSC LLQ 



+ 1PELC SAV ADSI RER MRY 

LHP 
> 

401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTC AGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+ 2 G P F IFRI YSA F 
> 

481 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTCGAGA 

561 TTTGAGAGGA GGCCTAAAGG AC AG G AG AAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 

641 AGCTAGTTTC AG AGTTAC C A TGTACGTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC C C AA G AC AT A 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 

CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTCGAGGT ACAGGACCCG 

801 CTAAAATCGT ATAAAATCTG GA 
GATTTTAGCA TATTTTAGAC CT 
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+ 3 M N I F L L 

NLLT EEV RLY 

j 



1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TCCTTTAGTT TAATCCTATT CTAAACATAG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3SCTP RNF SVS IREE LKR 
TDT IFWP GCL 

j 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 

-2 < 



+ 3 LVK RCGG N C A CCL H N C N 
ECQCVPSKV 



161 CCTGGTTAAA CGCTGTGGTG GG AACTGTGC CTGTTGTCTC CACAATTGCA 
ATGAATGTCA ATGTGTCCCA AGCAAAGTTA 

GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 



+ 3TKKY HEV L Q L R PKT GVR 
GLHK SLT D V A 



+ 1 V S G 

DCT MHSP TWP 

1 



2 41 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 
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-2 



+ 3LEHH EEC D CV CRGS TGG 

> 

+ 2 VQREHRR 
IAAS PPA ALA 

] 



+ 1 WST MRSV TVC AEG AQED 
SRI TTS SSC 



321 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QSCA VQW LIL LENV CVI 
SIL NLSC LLQ 



+ 1PE .LC SAV ADSI RER MRY 

LHP 
> 

4 01 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAAC G TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGAC7AAGAT AATCTCTTGC ATACGCAATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+ 2 GPF IFRI Y 5 A F 
> 

4 81 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTCGAGA 

561 TTTGAGAGGA GGCCTAAAGG AC AGG AGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TAT.CTAGTGG 

641 AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAACACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG G AA AAAAAC T CTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 
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CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTCGAGGT ACAGGACCCG 

8 01 CTAAAATCGT ATAAAATCTG GATTTTTTTN TTTTTTTTTG CGCATATTCA 
CATATGTAAA CCAGAACATT CTATGTACTA 

GATTTTAGCA TATTTTAGAC CTAAAAAAAN AAAAAAAAAC GCGTATAAGT 
GTATACATTT GGTCTTGTAA GATACATGAT 

881 CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAAC TTGTGT 
CGTGCTGATA GGACAGACTG GATTTTTCAT 

GTTTGGACCA AAAATTTTTC CTTGATACAA CGATACTTAA TTTGAACACA 
GCACGACTAT CCTGTCTGAC CTAAAAAGTA 

-3 < 



9 61 ATTTCTTATT AAAATTTCTG CC ATTTAGAA GAAGAGAACT ACATTCATGG 
TTTGGAAGAG ATAAACCTGA AAAGAAGAGT 

TAAAGAATAA TTTTAAAGAC GGTAAATCTT CTTCTCTTGA TGTAAGTACC 
AAACCTTCTC TATTTGGACT TTTCTTCTCA 

-3 



1041 GGCCTTATCT TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA 
CATTTTTATA TTCTCCTTTT GACATTATAA 

CCGGAATAGA AGTGAAATAG CTATTCAGTC AAATAAACAA AGTAAC AC AT 
GTAAAAATAT AAGAGGAAAA CTGTAATATT 
-3 [ 

1121 CTGTTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 
TTAATATTCT TTTTTATGAC AACTTAGATC 

GACAACCGAA AAGATTAGAA CAATTTATAT AGATAAAAAT GGTTTC C ATA 
AATTATAAGA AAAAATACTG TTGAATCTAG 

12 01 AACTATTTTT AGCTTGGTAA ATTTTTCTAA ACACAATTGT TATAGCCAGA 
GGAACAAAGA TGATATAAAA TATTGTTGCT 

TTGATAAAAA TCGAACCATT TAAAAAGATT TGTGTTAACA ATATCGGTCT 
CCTTGTTTCT ACTATATTTT ATAACAACGA 

12 81 CTGACAAAAA TACATGTATT TCATTCTCGT ATGGTGCTAG AGTTAGATTA 
ATCTGCATTT TAAAAAACTG AATTGGAATA 

GACTGTTTTT ATGTACATAA AGTAAGAGCA TACCACGATC TCAATCTAAT 
TAGACGTAAA ATTTTTTGAC TTAACCTTAT 

13 61 GAATTGGTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT 
TCCATTCCTG TTATTGGAGA TGAAAATAAA 

CTTAACCATT CAACGTTTCT GAAAAACTTT TATTAATTTA AT AG TAT AG A 
AGGTAAGGAC AATAACCTCT ACTTTTATTT 

14 41 AAGCAACTTA TGAAAGTAGA CATTCAGATC CAGCCATTAC TAACCTATTC 
CTTTTTTGGG GAAATCTGAG CCTAGCTCAG 

TTCGTTGAAT ACTTTCATCT GTAAGTCTAG GTCGGTAATG ATTGGATAAG 
G AAA AAAC C C CTTTAGACTC GGATCGAGTC 
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1521 AAAAACATAA AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCGT 
GCTGTGCTGT GCAGTAGGAA CACATCCTAT 

TTTTTGTATT TCGTGGAACT TTTTCTGAAC CGTCGAAGGA CTATTTCGCA 
CGACACGACA CGTCATCCTT GTGTAGGATA 

1601 TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG 
TATAAATACA TGGATATTTT TATGTACAGA 

AATAAC AC TA CAACACCAAA ATAATAGAAT TTGAGACAAG GTATGTGAAC 
ATATTTATGT ACCTATAAAA ATACATGTCT 



1681 AGTATGTCTC TTAACCAGTT CACTTATTGT ACCTGG 
TC AT AC AG AG AATTGGTCAA GTG AATAAC A TGGACC 
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S^^s** f'^* DNA and polypeptide sequence used for mammalian cell expression 

+ 1 mslfgXl lltsalagqr 

1 GGATCCAAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC CGGCCAGAGA 

+ 1 q g t q aES NLS SKFQ F S S NKE 
61 CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG CAACAAGGAA 

♦ 1QNGV QDP QHE R I I T VST NGS 
121 CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTG7G7CTAC TAATGGAAGT 

+ 1IKSP R F P HTY PR NT VLV WRL 
181 ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAA7A CGGTCTTGGT ATGGAGATTA 

+ 1VAVE E N V WIQ LTFD ERF GLE 
241 GTAGCAGTAG AGaAAAATGT ATGGATACAA CTTACGTT7G ATGAAAGATT TGGGCTTGAA 

+ 1DPED DIC K Y D FVEV E E P SDG 
301 GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC CAGTGATGGA 

+ 1TILG R W C GSG T V ? G K Q I SKG 
361 ACTATATTAG GC-CGCTGGTG TGGTTCTGGT ACTGTACCAG GAAA,-.CAGAT TTCTAAAGGA 

+ 1NQIR IRF VSD EYFP SEP GFC 
4 21 AATCAAATTA GGA7AAGATT TGTATCTGA7 GAATA7777C CTTCTGAACC AGGGTTCTGC 

+ 1 I H Y N I V M PQF TEAV S'PS VLP 
4 81 ATCCACTACA ACA77G7CAT GCCACAATTC ACAGAAGC7G TGAG7CCTTC AGTGCTACCC 

+-1 P S A L ? L D LLN' N A I T A F 5 TLE 
541 CCTTCAGCTT TGCCACTGGA CCTGCTTAA7 AA7GC7A7AA CTGCC77TAG TACCTTGGAA 

+ 1DLIR Y L E PER W Q L D I_ E 7 L Y ?. 
6 01 GACCT7A7TC GA7A7CTTGA ACCAGAGAGA 7GGCA377GG AC77AGAAGA TCTATA7AGG 

+ 1 P T W Q I L G K A F V F G R X S R V V D 
661 CCAACTTGGC AACT7CTTGG CAAGGCTT77 GTTTTTGGAA GAA-AA7CCAG AG7GGTGGAT 

+ 1LNLL 7EE VRL YSCT = R N FSV 
721 CTGAACCTTC TAA7AGAGGA GGTAAGATTA TACAGCTGGA CACC7CG7AA CTTCTCAGTG 

+ 1SIRE ELK RTD T I F W ?GC LLV 
781 TCCATAAGGG AAGAAC7AAA GAGAACCGA7 ACCATTT7CT GGCCAGG7TG TC7CC7GGT7 

+ 1 K R C G G M C A C C L K N C NEC Q C V 
841 AAACGC7G7G G7G3GAACTG TGCCTG7TG7 C7CCA7AA77 GCAA73AATG 7CAATG7C-7C 

♦I P S X V T K K Y H E V L C L R ? X T G V 
901 CCAAGCAAAG T7AC7AAAAA ATACCACGAG G7CG77CA37 7 G A G A C C AAA GACCGG7G7C 

+ 1 R G L H K S L T D V ALE H K E E CDC 
961 AGGGGA77GC ACAAA7CACT CACCGACG7G GCCC7GGA33 ACCA73AGGA G7GTGAC7G7 

> i V C R G S7G G S R G ? F E C- X ? I P N 
10 21 GTG7GCAGAG GGAGCACAGG AGGATCTAGA GGGCCC77 3G AAGC-T.V^GCC TATCCC7AAC 

¥l P L L G LPS T R T G H H H H H H 
10 81 CCTCTCC7CG G77TCGATTC TACGCGTACC GGTCA7CA77 ACCA77ACCA T7GA 
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/Vtp .m?- DNA and polypeptide sequence used for bacuiovirus/insect cell expression 

1 GAATTCAAAG GCCTGTATTT TACTGTTTTC GTAACAGTTT TGTAATAAAA AAACCTATAA 

+ 3 m k £ I v n valv f m v vyi syi 
61 ATATGAAATT CT7AGTCAAC GTTGCCCTTG TTTTTATGG7 CGTATACATT TCTTACATCT 

+ 3 y a DP " S H H K H K H E S NLS SKF 
121 ATGCGGATCC GGAGTCTCAC CATCACCACC ATCATGAATC CAACCTGAGT AGTAAATTCC 

+ 3QF SS N K E Q N G V QDP QHE R I I 
181 AGTTTTCCAG CAACAAGGAA CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA 

+ 3TVST NGS I H S P RFP H T Y P R N 
241 CTGTGTCTAC TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA 

+ 3TVLV WRL VAVE ENV W I Q LTF 
3 01 CGGTCTTGGT ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG 

+ 3DERF GLE DPED DIG K Y D FVE 

3 61 ATGAAAGATT TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG 

+ 3 VEEP SDG TILG R W C GSG TVP 
421 TTGAGGAACC CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG 

+ 3GKQI SKG N Q I R I R F V S D EYF 

4 81 GAAAACAGAT TTCTAAAGGA AAT CAAA T T A GG AT AAGATT TGTATCTGAT GAATATTTTC 

+ 3PSEP GFC I K Y N IV M PQF TEA 
541 CTTCTGAACC A.GGG7 T CTG C ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG 

+ 3 V S ? S VLP PSAL ? L D L L N N A I 
6 01 TGAG7CCTTC AGTGCTACCC CCTTCAGCT7 7GCCACTGGA CCTGCTTAAT AATG CT AT AA 

+ 3TAFS TLE DLIR Y L E PER WQ .L 
661 CTGCCTTTAG TACCTTGGAA GACCTTATTC GATATCTTGA AC C AG AG AG A TGGCAGTTGG 

+ 3DLED LYR P T W Q LLG K A F V F G 
721 ACTTAGAAGA TCTATATAGG CCAAC7TGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA 

+ 3RKSR VVD L N L L TEE VRL YSC 
781 GAAAATCCAG AGTC-GTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA 

+ 3TPRM FSV SIRE ELK RTD TIF 
84 1 CACCTCGTAA CTTCTCAGTG TCCATAAGGG AA C .AA C T AAA GAGAACCGAT ACCATTTTCT 

♦ 3 W P G C LLV KRCG GWC ACC LHM 
901 GGCCAC-GTTG TCTCCTGGTT AAACGCTGTG G7GGGAAC7G 7GCC7C77G7 C7CCACAA77 

+ 3 C N E C Q C V F 5 K V 7 K X Y H E V L Q 
961 GCAA7GAA7G TCAA7G7GTC CCAAGCAAA3 77ACTAAAAA A7ACCACGAG G7CC7TCAG7 

♦ 3 L R ? K T G V R G L K X S L T D V ALE 
10 21 TGAGACCAAA GACCGGTGTC AGGGGA7TGC ACAAATCAC7 CACCGACG7G GCCC7GGAC-C 

+3 H K E E SDC VCRG STG G 
1031 ACCATGAGGA CTCTGACTGT G7G7G GAG AG GGAGCACAGG AGGATAGCTC TAGA 



0 



> / 



8 



6 




uence used for E.coli expression 



+ 3 QTN SSS NNWN MMN N W W L G T 

1 CGCAGACTAA TTCGAGCTCG AACAACAACA ACAATAACAA TAACAACAAC CTCGGGATCG 

+ 3 E G R I S E F ESNL SSK FQF SSN 
61 AGGGAAGGAT TTCAGAATTC GAATCCAACC TGAGTAGTAA ATTCCAGTTT TCCAGCAACA 

+ 3KEQN GVQ DPQH ERI ITV STN 
121 AGGAACAGAA CGGAGTACAA GATCCTCAGC ATGAGAGAAT TATTACTGTG TCTACTAATG 

+ 3GSIH SPR FPHT Y P R NTV L V W 
181 GAAGTATTCA CAGCCCAAGG TTTCCTCATA CTTATCCAAG AAATACGGTC TTGGTATGGA 

+ 3RLVA VEE NVWI QLT FDE RFG 
241 GATTAGTAGC AG7AGAGGAA AATGTATGGA TACAACTTAC GTTTGATGAA AGATTTGGGC 

+ 3LEDP HDD ICKY DFV EVE EPS 

3 01 TTGAAGACCC AGAAGATGAC ATATGCAAGT ATGATTTTGT AGAAGTTGAG GAACCCAGTG 

+ 3DGTI LGR WCGS G T V P G K QIS 
361 ATGGAACTAT AT7AGGGCGC TGGTGTGGTT CTGGTACTGT AC CAGGAAAA CAGATTTCTA 

+ 3 K G N Q IRI RFVS D E Y FPS SPG 

4 21 AAGGAAATCA AATTAGGATA AGATTTGTAT CTGATGAATA TTTTCCTTC7 GAACCAGGGT 

+ 3FCIH Y N I VMPQ F7E AVS P S V 
4 81 7CTGCA7CCA CTACAACATT GTCA7GCCAC AA77CACAGA AGC7G7GAG7 CCT7CAG7GC 

+ 3LPPS ALP LDLL N M A I7A FS7 
541 7ACCCCCT7C AGC77TGCCA C7GGACC7GC T7AA7AATGC 7A7AAC7GCC T77AG7ACC7 

+3 L E D L IRY LEPE R W Q LDL HDL 
601 TGGAAGACCT TATTCGATAT C77GAACCAG AGAGA7GGCA G77GGAC77A GAAGA7C7A7 

+ 3YRP7 W Q L LGKA F V F GRK SRV 
661 A7AGGCCAAC T7GGCAAC77 CT7GGCAAGG C7777G7777 7GGAAGAAAA TCCAGAG7GG 

+ 3VDLN LLT EEVR LYS C7P RNF 
721 7GGA7C7GAA CC7TCTAACA GAGGAGG7AA GA77A7ACAG C7GCACACC7 CG7AAC7TCT 

+ 3SVSI REE LKR7 D7I FWP GCL 
731 CAG7G7CCA7 AAGGGAAGAA C7AAAGAGAA CCGA7ACCA7 777C7GGCCA GG77G7C7CC 

+ 3 L V K R CGG NCAC CLH N C N E C Q 
841 7GG77AAACG C7G7GGTGGG AAC7G7GCC7 G77G7C7CCA CAA77GCAA7 GAA7G7CAA7 

+ 3 C V ? S K V 7 K K Y K E V L Q L R P K 7 
901 G7G7CCCAAG CAAAGTTAC7 AAAAAA7ACC ACGAGG7CC7 7CA3T7GAGA CCAAAGACCG 

♦ 3 G V ?. G IKK S L 7 D V A L E H H EEC 
961 C7G7CAGGGG A77GCACAAA 7CAC7CACCG AC07GGCCC7 GGAGCACCA7 GAGGAG7GTG 

♦ 3 0 C V C ?. G S 7 G G H H H H H K 

1021 ACTGTG7G7G CAGAGGGAGC ACAGGAGGAC A7GATCACCA 7CACCA77GA 7C7AGAGTCG 

1081 ACC7G CAGGC AAGC77 
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//& Disulphide-linkcd dimcrisation of VEGF-X 
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DNA and polypeptide sequence used ioxE.coli expression of the PDGF-like domain 

+ 3 MR GSKH HHH HGM ASM 

1 AAGGAGATAT ACATATGCGG GGTTCTCATC ATCATCATCA TCATGGTATG GCTAGCATGA 

+ 3 TGGO OM G RDLY ODD DKD P G R 
61 CTGGTGGACA GCAAATGGGT CGGGATCTGT ACGACGATGA CGATAAGGAT CCGGG AAGAA 

+ 3KSRV VDL NLLT EEV RLY SCT 
121 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC 

+ 3PRNF SVS IREE LKR TDT IFW 
181 CTCGTAACTT C7CAG7G7CC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+ 3PGCL LVK RCGG NCA CCL KMC 
241 CAGGT7GTCT C CTGGTT AAA CGCTGTGGTG GGAACTGTGC C7G7TG7C7C CACAATTGCA 

+ 3NECQ CVP SKVT KKY HEV LQ L-. 
301 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA 

+ 3RPKT GVR.GLKK SLT D V A LEH 
3 61 G AC CAAAGAC CGGTGTCAGG GGATTGCACA AATCAC7CAC CGACGTGGCC CTGGAGCACC 

+ 3HEEC DCV CRGS TGG 
421 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG A7AATGAATT CGAAGC77GA 

4S1 7CCGGC7GC7 AACAAAGCCC 
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DNA and polypeptide sequence used for E,coli expression of the CXJB-like domain 

+ 2 MA MDIG INS DPE SHHH HHH 

1 GGCGATGGCC A7GGATATCG GAATTAATTC GGATCCGGAG TCTCACCATC ACCACCATCA 

+ 2 ESN LSSK F Q F SSN K £ Q N GVQ 
61 TGAATCCAAC CTGAGTAGTA AATT CCAGTT TTCCAGCAAC AAGGAACAGA ACGGAGTACA 

+ 2 DPQ H E R I ITV STN GSIH SPR 
121 AGATCCTCAG CATGAGAGAA TTATTACTGT GTCTACTAAT GGAAGTATTC ACAGCCCAAG 

+ 2 FPH TYPR MTV L V W RLVA VEE 
181 GTTTCCTCAT ACTTATCCAA GAAATACGGT CTTGGTATGG AGATTAGTAG CAG7AGAGGA 

-1-2 NVW IQLT FDE RFG LED? HDD 
241 AAATGTATGG ATACAACTTA CGTTTGATGA AAGATTTGGG CTTGAAGACC CAGAAGA7GA 

+ 2 ICK YDFV EVE EPS DGTI LGR 
3 01 CATATGCAAG TATGATTTTG TAGAAGTTGA GGAACCCAG7 GATGGAACTA TATTAGGGCG 

+ 2 WCG SGTV FGK QIS K G X Q I R I 
3 61 CTGGTGTGGT TCTGGTACTG 7 AC CAGG AAA ACAGATTTCT AAAGG AAA TC AAAT7AGGAT 

+ 2 RFV SDEY FPS EPG F C I H Y N I 
421 AAGATTTGTA 7GTGATGAAT ATTTTCCTTC TGAACCAGGG T7CTGCATCC AC7ACAACAT 

+ 2 VMP CF7E AV 
491 TGTCA7GCCA CAAT7CACAG AAGC7G7GTA G7CGAGCTCC G7 CGA.C AAG C T7GCGGCCGC 



541 AC7CGAGCAC 



Expression of the CUB domain in Exoli 
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Partial intron/exon structure of the VEGF-X gene 



(A) - Genomic DNA sequences of 2 exons determined by sequencing 



C C CC C C 1 1 a t acca t a cag tggtcga t c tgaaccagGGTTCTGCATCCACTACAACATTGTCATGCCACAATTCACAGAAGCTGTG 

AGTCCTTCAGTGCTACCCCCTTCAGCTTTGCCACTGGACCTGCTTAA^ 

TCGATATCTTGAACCAGAGAGArC<;CAGTTGGACTTAGAAGATCTAT^^ 

TTGGAAGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACC 

TCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTG 
TTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGg t aggtatacaat tttcttttt 
ggttcccttcgggtatttta terete 

aaagccagtcatagacatccgtcgatttttaaaagcggcttactctcatitccctcccagGTCCTTCAGTTGAGACCAA^.GACCGGT 

GTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAG 

ATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCA\GGACCTTTCATCTT 

GAGTTGTGCAACAGCrCTTTTGAG^GGAGGCCTAAAGGACAG^ 

AAATAGATCACCAGCTAGTTTCAGAGT7ACCATGTACGTATTCCACTAGCTGGG7TCTGTATTTCAGTTCTTTCGATACGGCTTAG 

ggtaatgtcagtacaggaaaaa.^actgtgcaagtgagcacctgattccgttgccttggcttaactctaaagctccatgtcctgggc 

ctaaaatcgtataaaatctggattttttttttttttt^ 

tggtttttaaaaaggaactatgttgctatgaattaaact 

aatttctgccatttagaagaagagaactacattcatggtttggaagagata.^ 

tcgataagtcagtttatttgtttc^ttgtgtacatttttatattctccttttgacattataactgttggcttttcta-atcttgtta 
aatatatgtatttttaccaaaggtatttaatattctttttt^ 

acacaattgttatagccagagg^acaaagargatataaaatattgttgctctgacaaaaatacatgtatttcattctcgtatgg'tg 
ctagagttagattaatctgcattttaaaaaactgaattggaata 

tcatatcttccattcctgttattkagatgaaaataaaaagcaacttatgaaagtagacattcagatccagccattac 
tccttttttggggaaatctgagcctagctcagaaaaacataa-^gcacc^ 
tgctgtgcagtaggaacacatcctatttattgtgatgttgtc^tt^ 
tggatatttttatgtacagaagtatgtctcttaagcagttcact^ 

TGCTTGTAAAATGCTTAATATCGTGCCTAGGTTATGTCKjTGAC^ 

GGCTATTTTGGGGAGAAAATTatgtgtgtgcgtgcc caagar t tat t tec tggactc tgagaaaargaaaga t aaa 
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(B) - Location of splice sites within the cDNA sequence 

1 GAATTCGCCC TTTTGTTTAA ACCTTGGGAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC 

61 TTTTCAAAAA CTGGAGACAC AGAAGAGGGC TCTAGGA.1AA AGTTTTGGAT GGGATTATGT 

121 GGAAACTACC CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG 

131 CCTTCCCCTG GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG 

+J MSLFGLLLLTS 
241 AGTGAGCTCT CACCCCAGTC AGCCAAA TGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG 

+3 A LA G Q R Q G T Q A ESN L S S K F Q 
301 CCCTGGCCGG CCAGAGACAG GGGACTCAGG CGGAATCC.VL CCTGAGTAGT AAATTCCAGT 

X E Q N G V Q D P Q HER I I T 
3 61 TTTCCAGCAA CAAGGAACAG AACGGAGTAC AAGATCCTCA GCA TGAGAGA A TTA TTACTG 

+3 V S T N G S I H S P R F P H T Y P R N T 
421 TGTCTA CTAA TGGAAGTATT CA CAG GCCAA GGTTTCCTCA TACTTATCCA AG AAA TACGG 

+3 V L V W R L V A V E E N V W I Q L 7 F D 
431 TGTTGGTATG GAGATTAGTA GCAGTAGAGG AAAATGTATG GA TACAACTT ACGTTTGATG 

+ 3 E R F G LED P E D D I C K Y D F V E V 
541 AAAGA TTTGG GCTTGAA GAC CCAGAAGA7G A CA TA TGCAA GTATGATTTT GTAGAAGTTG 

+ 3 E E P S D G T I L G R W C G S G T V ? G 
601 AGGAA CCCAG TGATGGAACT ATATTAGGGC GCTGGTGZGG TTCTGGTACT GTACCAGGAA 

+ 3 K Q I S K G N Q I R I R F V SUE Y F P 
661 AACAGATTTC TAJUGGAAAT CAAATTAGGA TAAGATTTGT A TCTGA TGAA TATJTTCCTT 



(-3 S E P 



? C I H Y N I VM? Q?T E A V 



721 CTGAACCAC G GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTG7GA 

+ 3SPSV L?P SAL? LDL L N N AIT 
781 GTCCTTCAGT GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAA7AAT GC7ATAACTG 

+ 3AFST LED LIRY L E ? E R W OLD 
841 CCTTTAGTAC CTTGGAAGAC CTTATTCGAT ATCTTCAACC AGAGAGA7GG CAGTTGGACT 

+ 3 L E D L Y P 7 W Q L L G K A ? V F G R 

901 TAGAAGATCT ATATAGGGCA ACT7GGCAAC TTCTTCGC^A GGCTT77GTT T77GGAAGAA 

"3 K S R V V D L N" L L 7 S ^ // 
951 AATCCAGAG7 C-G7GGA7C7G AACCT7CTAA CAGAGGAX7 AAGA7TA7AC AGCTGCACAC 

k3 p R N r 5 V S I R E E . L X ?. 7 D T I 7 W 
CTCG7AAC7T CT7AGTG7CC A 7 AA G GG AA G AACTAAAGAG AACCGA7ACC A7TTTCT GGC 



1021 



+ 3 P G C L L V K R C G G MCA C C L H N C 
10 81 CAGGTTGTCT CCTGGTTAAA CGC7G7GC7G GGAACTGTG 7 C7G77G7CTC CACAATTGCA 

+ 3 N E C Q C V P S K V 7 K Y Y H E I V L Q L 
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+ 3RPKT GVR GLHK SLT D V A L E H 
1201 GACCAAAGAC CGG7GTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+ 3HEEC DCV CRGS TGG 
1261 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 

13 21 AGCTCTTGCC CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 

13 81 CTCCATCCTT AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC 

1441 ATTCTGAAAG AGGAGACATC AAACAGAATT AGGAGTTGTG CAACAGCTCT TTTGAGAGGA 

1501 GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT GTTGTATTAA 

1561 ATAGATCACC AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 

1621 TTCAGTTCTT TCGATACGGC TTAGGGTAAT GTCAGTACAG GAAAAAAACT GTGCAAGTGA 

1681 GCACCTGATT CCGTTGCCTT GCTTAACTCT AAAGCTCCAT GTCCTGGGCC TAAAATCGTA 

1741 TAAAATCTGG ATTTTTTTTT TTTTTTTTTG CTCATATTCA CATATGTAAA CCAGAACATT 

1801 CTATGTACTA CPAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 

1351 CATGCTGATA GGACAGAC7G GA7TTTTCA7 ATTTCTTATT AAAATTTCTG CCATTTAGAA 

1921 GAAGAGAACT ACAITCATGG TTTGGAAGAG ATAAACCTGA AAAGAAGAGT GGCCTTATCT 

19 31 TCACTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA CATTTTTATA TTCTCCTTTT 

2041 GACATTATAA CTC-TTGGC77 TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 

2101 TTAATATTCT 77777A7GAC AACTTAGA7G AACTATTTTT AGCTTGGTAA A7TT7TCTAA 

2161 ACACAATTGT TATAGCCAGA GGAACAAAGA TGATATAAAA TATTG77GCT C7GACAAAAA 

2 221 TACA7G7ATT 7CAT7C7CG7 A7GG7GC7AG AG7TAGA77A ATC7GCA7T7 7AAAAAACTG 

2231 AATTGGAATA GAATTGGTAA G7TGCAAAGA CTTTTTGAAA A7AA7TAAA7 7A7CATATCT 

23 41 TCCAT7CCTG 77A7TGGAGA 7GAAAATAAA AAG CAACTT A TGAAAGTAGA CA7TGAGATC 

2 401 CAGCCA7TAC 7AACCTATTC C7TTTTTGGG GAAATCTGAG CC7AGCTCAG AAAAACATAA 

2 461 AGCACC7TGA AAAAGAC77G GCAGCTTCCT GATAAAGCG7 GC7G7GC7G7 GCAGTAGGAA 

2521 CACATCC7AT 7TAT7G7GA7 G77G7GG777 7ATTA7C77A AACTC7G77C CA7ACACTTG 

2531 TATAAA7ACA 7CCATA7777 7A7GTACAGA AGTA7G7C7C 7TAACCAGTT CACTTATTGT 

2 64 1 ACCTGGAAGG GCGAAT7C7G CAGATATC 
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